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O We reconsider a class of heterotic string theories studied in 1989, based on tensor products of 

N = 2 minimal models with asymmetric simple current invariants. We extend this analysis from 
(2, 2) and (1, 2) spectra to (0, 2) spectra with SO{10) broken to the Standard Model. In the latter 



o 

case the spectrum must contain fractionally charged particles. We find that in nearly all cases 
^ at least some of them are massless. However, we identify a large subclass where the fractional 

^ charges are at worst half-integer, and often vector-like. The number of families is very often 

^ reduced in comparison to the 1989 results, but there are no new tensor combinations yielding 

H three families. All tensor combinations turn out to fall into two classes: those where the number 

of families is always divisible by three, and those where it is never divisible by three. We find 
an empirical rule to determine the class, which appears to extend beyond minimal N = 2 tensor 
products. We observe that distributions of physical quantities such as the number of families, 
singlets and mirrors have an interesting tendency towards smaller values as the gauge groups 
approaches the Standard Model. We compare our results with an analogous class of free fermionic 
models. This displays similar features, but with less resolution. Finally we present a complete 
scan of the three family models based on the triply-exceptional combination (1, 16*, 16*, 16*) 
identified originally by Gepner. We find 1220 distinct three family spectra in this case, forming 
610 mirror pairs. About half of them have the gauge group SU{3) x SU{2)l x SU{2)r x ^7(1)^, 
the theoretical minimum, and many others are trinification models. 
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1 Introduction 



In 1989 one of us co-authored a paper pQ on tensor products of = 2 minimal models 
(also known as "Gepner models"), applied to the construction of heterotic strings. In this 
paper the original construction of Gepner [2] was generalized from diagonal (or charge 
conjugation) modular invariants to products of simple current invariants. There have 
been other studies of such generalizations but they were limited to (2, 2) conformal 

field theories with families in the representation (27) of Eq. In fl] some other ideas 
were considered: the possibility to break space-time and world-sheet supersymmetry in 
the bosonic (left-moving) sector of the heterotic string, and the possibility to break the 
gauge group Eq to 50(10), SU{5) or SU{3) x SU{2) x f/(l). Of all these possibilities, 
only one was systematically studiec|^for all 168 combinations of minimal models, namely 
breaking left-moving space-time supersymmetry, which results in breaking Eq to 5*0(10). 
The additional breaking of world-sheet supersymmetry was only studied in one example, 
namely the (1, 16*, 16*, 16*) Gepner model, where the asterisk indicates the use of an 
exceptional modular invariant. This tensor product was singled out because it was the only 
one known to give rise to spectra with three families |5] (see also [6] for the corresponding 
Calabi-Yau construction). The breaking of 50(10) to the Standard Model was merely 
mentioned in the conclusions of pQ, where the authors said about these options: "We 
hope to come back to this in the future" . 

Indeed, the issue of 50(10) breaking to the Standard Model was discussed shortly 
after in [7], and led to the disappointing conclusion that the spectrum of such theories 
necessarily contains fractionally charged particle^ This generalized an earlier result j8J 
for Calabi-Yau compactifications. The appearance of fractionally charged particles is not 
necessarily a phenomenological disaster, since such fractionally charged particles may be 
massive, or confined by non-SM interactions, but it does destroy the beautiful under- 
standing of absence of fractional charges based on SU{5) or 50(10) GUT models. 

The purpose of the present paper is to return to the work left unfinished in 1989, and 
consider these issues again in the light of new insights, new theoretical methods as well 
as vastly improved computational power. 

An important change in perspective that has taken place since then is certainly what is 
now called the string theory landscape [9] [10] . While it was clear to some people already 
a few years before 1989 that string theory was not leading to a unique gauge theory, and 
that this was a feature, and not a bug [HI |T2j, the full scope of the set of possibilities 
was not understood (and may well be largely unknown even now). Meanwhile several new 
fertile areas in the landscape have been discovered and explored, most notably orientifolds 
(for a review see [H] ) and F-theory [T5l [T6] . 

^The results, referred to as (1, 2)-models in [T], are available as scanned pdf files via the website 
www.nikhef.nl/^t58, on the page "Hodge numbers" . They were obtained using a nearly saturated random 
search. On the same page one can find a recent complete enumeration of all (2,2) Gepner models obtained 
by means of simple currents. 

^Throughout this paper, "fractional charge" refers to electric charges of elementary or QCD-composite 
color singlets. In other words, to representations of the Lie algebra SU{3) x SU{2) x U{1) that are not 
representations of the group S{U{3) x U{2)). 
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Although these developments make it more likely that the Standard Model (as we 
know it today) can actually be realized in terms of string theory or M-theory, perhaps 
paradoxically, they also cast some doubts upon the original evidence in favor of string 
theory as a theory of all interactions. In 1984, the encouraging discovery was that Grand 
Unified Theories emerge naturally from heterotic strings, merely by compactifying them 
to four dimensions. However, we know now that heterotic strings are only part of the 
story, and that in all other string constructions GUTs only arise because one imposes 
GUT unification as a condition. Consider for example orientifold models. In a subclass 
of these models (characterized by "x = 0" in the terminology of [i7\), SU{5) GUTs are 
possible. However, instead of putting five branes on top of each other to get SU{5), 
one can also take two stacks of three and two at different positions. Not surprisingly, 
the latter configuration is realized much more frequently. In another class of orientifold 
models, the x = \ models that include the "Madrid" configuration [TH] and that dominate 
the RCFT orientifold landscape, approximate gauge coupling unification can only arise 
as a coincidence. 

So there is some tension between GUTs and strings. On the one hand, the afore- 
mentioned result on charge quantization [7] ruins the nice GUT explanation precisely in 
the one context were Grand Unification appears to arise naturally in string theory: the 
heterotic string. But in other string constructions, such as a; = orientifolds, F-theory, 
or higher level affine Lie algebras, the GUT explanation for absence of fractional charges 
does work, but it is imposed by hand. In other words, in string theory one cannot have it 
both ways: a natural explanation for GUTs, plus a natural explanation for charge quanti- 
zation. Of course GUT coupling convergence and charge quantization are extrapolations 
of low energy empirical observations. Perhaps they are simply wrong, and perhaps fu- 
ture experimental results reveal evidence for fractional charges, or counter-evidence for 
coupling unification. But if we take these empirical observations as real indications of 
fundamental physics, perhaps we ought to conclude that they point to an entirely differ- 
ent kind of theory, where unification is a fundamental feature, as it once seemed to be in 
the heterotic string. 

The common phenomenological answer to these objections is that we should merely 
try to find a string theory that fits the data, no matter how rare or unnatural it may seem 
in the full set of possibilities. Indeed, finding an exact string theory realization of the 
Standard Model would be tremendously important. But of course such an achievement 
becomes less and less impressive as the known landscape becomes larger and larger. Fur- 
thermore there is a serious danger in dismissing apparent counter-evidence too quickly. If 
some feature - such as the absence of fractional charges or the correct number of families 
- is difficult to get, this fact may provide essential information. It may indicate that 
one is considering the wrong class of string theories or that one is seeing artifacts of an 
approximation. It may also indicate that we are missing some essential part of the under- 
lying physics, and that the rare feature is enhanced by some mechanism beyond current 
control, such as moduli stabilization. 

In certain cases, a rare feature may be needed for the existence of observers. In the 
context of the string landscape it is nearly inevitable that the (poorly named) "anthropic 
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principle", in some form will play a role in determining the features we observe: other 
universes could be possible, and many of them would be lifeless. The most conservative 
use of that principle is that one should not worry too much about rare features if they 
have a potential anthropic explanation. Any such explanation necessarily consists of two 
parts: an understanding of how a certain feature is distributed in the landscape, and an 
understanding of how changing it affects the existence of life. Both these issues can be 
studied scientifically. Nevertheless, the mere mention of the word "anthropic" tends to 
generate controversy. For a detailed discussion of these issues and the attitudes towards 
them we refer to [13]. Of course, if all of the above fails we may ultimately have to 
conclude that we simply live in a universe that, for no known reason, is rare in certain 
respects, but that is the least attractive option. 

In this paper we will focus on two issues: fractional charges and the number of fam- 
ilies. Neither of these is totally immune to anthropic considerations. Fractional charges 
can affect the evolution of matter in the universe and the chemistry and nuclear physics 
upon which life is based in numerous ways. Perhaps their most important generic fea- 
ture is that they are stable (although in some cases they may be bound by non-Standard 
Model interactions into integer charge bound states, which need not be absolutely stable, 
analogous to the proton). Stable additional matter will affect all abundances of parti- 
cles, will change big bang nucleosynthesis, the expansion of the universe and the energy 
balances of stars and nuclei, and will drastically alter chemistry, just to mention a few 
obvious consequences. In extreme cases, it may even be possible for the electron to de- 
cay into two or more light fractionally charged particles. Even then, one is still facing 
the question whether or not different kinds of life could be built out of the fractional 
charges themselves. There is a bewildering number of possibilities for the masses and 
interactions of fractionally charged matter, and it is a major challenge to come up with a 
convincing general argument against their existence, especially since additional fractional 
charges complicate physics substantially. But it seems implausible that the answer to the 
question: "why haven't we observed fractionally charged particles?" is: "because no life 
is possible in their presence" . 

Roughly the same can be said about the number of families: there is currently no 
argument why three families would be required for life to exist. Our kind of life requires 
at least one family, and if there are more than eight the strong coupling constant be- 
comes weaker at low energy, which is probably fatal for nuclear physics and life. Detailed 
examination of the running of all couplings, including higher loop effects, may decrease 
this interval. The existence of a large top quark mass may turn out to be crucial, since 
it has large loop effects on some couplings, and may affect weak symmetry breaking. If 
true, this would increase the minimum number of families to two. If the CKM CP-phase 
is somehow essential for life (perhaps through a presently less plausible role in baryo- 
genesis) the minimum would be three. In the absence of any anthropic explanation for 
three families, one may hope that the family distribution in the landscape peaks at the 
value three. Otherwise we may just have to accept that we ended up in a universe with 
three families just by chance. This would be still be perfectly acceptable if the family 
distribution is reasonably flat. However, previous studies of family number distributions 
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seem to suggest that in fact precisely the number three is difficult to obtain. In the 
area of orient if olds, systematic studies have been done for a variety of Standard Model 
features [121 1201 EH [IZl 122] • Although most of this work has focused on getting the Stan- 
dard Model itself, a number of conclusions can be drawn about distributions of certain 
parameters and discrete properties in its neighbourhood, such as coupling constants, the 
presence of various kinds of GUTs, the number of families, or the presence of fractional 
charges. These studies revealed a disturbing dip in the number of families, precisely for 
the observed value of 3, which occurred less frequently than 2 or 4 by two to three orders 
of magnitude [191 120]. 

In order to appreciate the foregoing anthropic considerations, suppose we were dis- 
cussing the number of colors Nc rather than the number of families. It is easy to define 
variants of the Standard Model with a different number of colors, with appropriate changes 
in the electric charges of the quarks. In orientifold models, such variations have a natural 
quiver description similar to the Madrid model, most easily obtained by starting with a 
Pati-Salam type configuration (A^^+l, 2, 1) + (iV^+T, 1, 2) of SU{N^+1) x SU{2) x SU{2). 
In orientifold models the color distribution can just as easily be studied systematically 
as the family distribution (note that in heterotic models based on S'O(10)-embeddings 
this is not possible). But for color, as opposed to the number of families, the numbers 
1, 2 and 4 lead to entirely different physics than 3. There would be no baryons at all, 
or only bosonic ones. A distribution of colors with a big dip at the value of three would 
not be a cause for concern, because the neighbouring values can be ruled out on plausible 
anthropic grounds. The same cannot be said for the value Nc = 5. However, it is likely 
that any such distribution would drop off rapidly with the number of colors, since tadpole 
cancellation requirements make it harder to accommodate larger stacks of branes. If that 
is indeed true, one can say that in the orientifold landscape the observed value = 3 
is understood by a combination of anthropic arguments and distributions. In any case it 
is much easier to accept a dip at three in the color distribution than a similar dip in the 
family distribution. 

This immediately raises the question if this dip in the number of families is an artifact 
of orientifolds or of the methods used (rational CFT or orbifolds), which is precisely one 
of the reasons for studying the same question in heterotic strings. In that area, much 
less is known so far. There is a fairly extensive amount of information on possible Hodge 
numbers or other topological data of Calabi-Yau manifolds {e.g. [23], [21]) which provides 
family distributions for Eq and other GUT gauge groups, but there is little on family 
distributions for the Standard Model gauge group for exact string constructions. This is 
even more true for the presence or absence of fractional charges in the massless spectrum. 
The only exception is the free fermionic construction, where results on distributions of 
gauge groups are available [25t 126] , and where recently some studies with a similar focus 
as the present work have been made [23 [2H1 129] . 

The fact that three families can be obtained from heterotic strings has been demon- 
strated in various constructions; some early references are for example [301 ED 1321 [331 IM]. 
In the context of Gepner models examples were found in [5], [1] and [35]. There is also 
a large body of recent work on getting as close as possible to the currently known Stan- 
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dard Model spectrum using heterotic strings (see e.g. [361 EH ESI HQl HI] and references 
therein), using combinations of string theory and effective field theory. But this is not 
what we are aiming at. The question we are addressing here is not if some of these features 
of the Standard Model can be realized (undoubtedly they can), but how rare they are. 

The work referred to in the previous paragraph uses either geometric methods or free 
conformal field theory. We wish to explore the area in between. The class we consider, 
interacting rational CFT, of which minimal N=2 tensor products are a special case, forms 
an interesting area in between free field constructions and geometric constructions. The 
former allow the easy construction of many examples, but are limited in scope, since 
a free CFT is obviously a very special case. The results of orientifold model building 
demonstrate that. Interacting CFTs provide a much richer set of models, and much 
less constrained and more generic-looking distributions of physical quantities. To make 
a similar comparison for heterotic strings, we compare our results with free fermionic 
constructions in an otherwise identical situation. Geometric methods are even broader in 
scope than interacting CFTs, but it is harder to compute exact spectra. 

This paper is the first in a series of studies of heterotic strings built using minimal 
N = 2 model tensor products, thus providing the missing heterotic counterpart to [19] and 
|17] . In the present paper we take up the line of research abandoned after [1], and study 
(0, 2) models with broken 50(10). These missing old results will set the stage for future 
work, where we will also consider breaking of the U{1)b-l group, the extra heterotic Eg 
factor and replace one or more of the minimal model building blocks in the bosonic sector. 
The class we consider is of course only a subset of all possible heterotic strings, with the 
special property that the Standard Model is embedded in the canonical way in SO (10), 
and precisely that SO{10) that is linked, via modular invariance, to the NSR model in 
the fermionic sector. This is the class of heterotic strings considered first in 1985 [12] 
[i3] . and that generated a lot of excitement because families of Eq and subgroups of Eq 
came out automatically. Meanwhile even within the area of heterotic strings many other 
possibilities have been explored. 

We will focus on two questions that remained unanswered two decades ago: Will the 
breaking of 5*0(10) and/or left-moving world-sheet supersymmetry allow us to change the 
somewhat disappointing old results regarding the number of families? With the exception 
of the three-family model found by Gepner, in all other cases the number of families was 
found to be a multiple of 6 or 4 in 1989. Breaking some symmetries may affect that, 
and indeed we will see that it does. The second question is: How do the fractionally 
charged particles appear in the spectrum? Can they be massive, or at least vector-like, 
and how often does that happen? In 1989 only a few SU{3) x SU(2) x f/(l) spectra 
were examined, and they were discarded because fractional charges were present in the 
massless spectrum. But it is not excluded that deeper searches provide examples where 
all of them are massive. Furthermore the chirality of the fractional charge representations 
was not examined. Vector-like particles exist in essentially all exact string spectra. Either 
they generically get a mass if the theory is perturbed and/or supersymmetry is broken, 
or we should conclude that string theory is wrong. But if indeed they generically do get 
masses, the same logic can be applied to vector-like fractionally charged particles. 
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This paper is organized as follows. In the next section we discuss and review the CFT 
model building aspects, define the class we are examining and explain how we scan the set 
of MIPFs. In section 3 we discuss charge quantization, and display a range of possibilities 
in between SO (10) unification and complete SO (10) breaking, and show how at least 
half-integer charges or third integral charges can be avoided without paying a price. In 
section 4 we discuss our search results. We present the relative frequencies of occurrence 
of the various possibilities for charge quantization, and present an intriguing observation 
regarding the quantization of the number of families, which appears to be valid for other 
CFT constructions (in particular Kazama-Suzuki |37] models) as well. We also present 
some distributions of non-chiral quantities (singlets and mirrors) for the different spectra 
we have found. In section 6 we present some results on the only tensor product that gives 
rise to three families, based on the original example found by Gepner [5]. In the last 
section we formulate our conclusions. 

2 CFT construction 

The class of heterotic strings considered here is defined as follows. We will make use of the 
bosonic string map [H], and we first build a four-dimensional bosonic string theory with 
the special property that its right-moving sector can be mapped to a fermionic sector. 
This bosonic string is built as a tensor product of a set of CFTs with a total central charge 
equal to 5, an Eg, level 1 affine Lie algebra, and a c = 9, iV = 2 internal CFT. The latter 
CFT is obtained by tensoring certain building blocks, which can be any N = 2 CFT. The 
c = 5 system, denoted S in the following, is required to allow a chiral algebra extension 
to D^. We start with the diagonal partition function of this entire system. 

Now we extend the chiral algebra of the right-moving sector in the following way. First 
we extend the c = 5 system to D^. All tensor products are ordinary, non-supersymmetric 
CFT tensor products, which in particular means that the set of states contains combi- 
nations of NS and R states, which violate world-sheet supersymmetry. We project all 
of these out by extending the right chiral algebra by all combinations of the vector of 

with the world sheet super currents of the N = 2 building blocks (sometimes called 
"alignment currents"). Next we extend the right chiral algebra by a combination of the 
spinor of 5*0(10) and Ramond ground states of each of the N = 2 building blocks. Finally 
we replace, in the right-moving sector, the characters of 5*0(10) x by Lorentz-covariant 
NSR characters, and we obtain a heterotic string with = 1 space-time supersymmetry. 

All the foregoing extensions are required in the right sector only. However, if we keep 
the left sector unchanged the result would not be modular invariant. The latter can 
be preserved by making exactly the same extensions in the left sector as well, but this 
automatically implies the presence of phenomenologically undesirable gauge symmetries. 
In the class of models considered here, the Standard Model gauge bosons originate from 
spin-1 currents in the CFT S. The chiral algebra extensions lead to the extension of S to 
50(10) and finally E^. 

However, the left and right algebras only need to be identical from the perspective of 
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the modular group. This allows in particular the replacement, in the left sector, of spin-1 
currents by higher spin currents, all with the same mutual monodromies. The alignment 
currents have spin 2 and have no phenomenological implications for the gauge group, and 
hence there is no need to replace them, but requiring them to be identical in the left and 
right sector is an unnecessary restriction, which we might as well remove also. If only 
the space-time supercurrent is realized asymmetrically, a (1,2) model is obtained. If the 
extension to 5*0(10) or world-sheet supersymmetry is realized asymmetrically we will call 
the resuh a (0, 2) model [lSllinill71llHlliniEniEIll32lE3]. 

In general, very little is known about asymmetric partition functions. They can be 
constructed systematically using simple currents (including free bosons as a special case) 
or free fermions. Furthermore a few exceptional examples are known, such as the mero- 
morphic c = 24 CFTs [54J (which can be freely combined in the left and right sectors). A 
more radical approach is to replace the conformal building blocks themselves by distinct, 
but isomorphic ones. An example are the "lifted" Gepner models [55j . 

In this paper we keep the same building blocks, namely N = 2 minimal models, in 
both sectors. Essentially we will do precisely what was announced in pLj, but was only 
partly completed then. The novel features of the present paper in comparison to [1] are 

• 50(10) breaking. This was only mentioned in [IJ, but not carried out because of 
computational limitations. 

• Asymmetric realizations of alignment symmetries, yielding (0, 2) spectra. In [IJ this 
was only done for a single tensor product, the (1, 16*, 16*, 16*). In all other cases 
only (2, 2) and (1, 2) spectra were considered. 

• Use of the general simple current construction described above. This formalism was 
not yet available in 1989. In [1] only products of several single current MIPFs were 
considered, a limitation which misses some cases with discrete torsion. 

To build MIPFs we make use of the formalism developed in [561 [57] . which can be 
shown to generate the most general MIPFs with non-vanishing off-diagonal multiplicities 
Zij that lie entirely on simple current orbits: Zij ^ only if i and j are on the same 
simple current orbit. A general simple current invariant is described by a set of simple 
currents forming a discrete abelian group Q, and a matrix X. The group ^ is a product 
of k cyclic factors, each generated by a current Js- The relative monodromies of these 
currents define a monodromy matrix R that is defined as Rst = Qs{,Jt) mod 1, plus a 
further constraint that fixes its diagonal elements modulo 2, depending on the conformal 
weight of the currents. The matrix X (defined modulo 1) has to satisfy the equation 

X + X"^ = R. (1) 

The matrix X determines the matrix Z{Q,X)ij in the following way: Z^j is equal to the 
number of solutions J to the conditions 

i = Ji, J E Q and 
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QK{^)+X{K, J) 



= mod 1 for all K e G 



(2) 



Here X{K^ J) is defined by writing K and J in terms of the generating currents Jg-, 
J = (Ji)"i...(Jfc)"'=, K = {Ji)'^\..{JkTK Then 

X{K, J) = risrritXst ■ 

s,t 

The group Q can be any subgroup of the full simple current group. The matrix X is 
a set of rational numbers. For each tensor product this yields a huge, but finite set of 
possibilities. For example, an = 2 minimal model at level k has a simple current 
'Z'2k+4 X Z2 for k even and Z4k+8 for k odd. For the 50(10) subgroup S we will use 
SU{3) X SU{2) X f/30 X U20 in this paper (see the next chapter). If we choose the internal 
sector (3, 3, 3, 3, 3) (which denotes the tensor product of five k = 3 minimal models), the 
total simple current group is Z3 x Z2 x Z30 x Z20 x (Z2o)^- The total number of MIPFs 
can be computed by splitting this group into prime factors, and multiplying the number 
of MIPFs corresponding to each prime factor. There exists a formula for the number of 
MIPFs for discrete groups (Zp)^, p prime [56] : 

K-l 

Nmipf= l[{l+p') 
e=o 

The generalization of this formula to factors Zpn, n > 1, is not known. In this example, 
the seven factors Z5 already contribute 1.202.088.011.709.312, the two factors Z3 multiply 
this by eight, and then there still is a factor (Z2)^ x (Z^)^ which multiplies this with 
another huge number. These numbers are reduced by the requirement of the presence of 
5*0(10), alignment and supersymmetry currents in the fermionic sector. Furthermore, in 
this particular example there will be a redundancy due to the permutation symmetry of 
the five k = 3 models. 

The building blocks of the internal CFT that we will use are the N = 2 minimal 
models. They can be combined in 168 ways to form c = 9 CFTs. In addition we consider 
the 59 CFTs obtained by using exceptional SU{2) modular invariants, which exist for 
level 10, 16 and 28. 

Finally, to put the results in perspective, we also consider N = 2 building blocks 
constructed out of free fermions. Here the c = 9 internal sector is build out of combinations 
of A^c complex and Nr real fermions {{Nc,Nr) can be (9,0), (7,4), (5,8) and (3,12)) with 
world-sheet supersymmetry realized through the so-called "triplet constraint" [SHlEniEn]- 
The symmetric (2, 2) combinations of these models were considered in [HI] as a starting 
point for orientifold constructions. In total, one may distinguish 62 different diagonal 
CFTs, which differ in the way the real and complex fermions are combined into triplets. 
This is however not the most general free fermionic construction: we only build the c = 9 
internal CFT out of free fermions (we leave Eg intact), and we exclude extensions by 
combinations of spinor currents of real fermions, because they are not simple currents. 
This is also the reason [61] why we must keep three fermions complex: otherwise we 
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cannot impose space-time supersymmetry as a simple current invarianl]?] One may also 
combine Gepner and free fermionic building blocks, but we will not consider that option 
here. 

2.1 Search procedure 

It seems essentially impossible to search the huge set of MIPFs systematically, even if 
one takes the constraints into account from the start. In the present paper we deal with 
this as follows. First we determine in the fermionic sector all character combinations that 
contribute to the massless sector, and that are local with respect to the S'O(IO) currents, 
the alignment currents and the space-time supersymmetry current. Then we generate a 
subgroup Q by choosing at random M elements of the full simple current group. Then 
we choose at random a matrix X that satisfies ([T]). We compute the matrix Z(Q,X)ij, 
act with it on the set of right-moving massless states, and read off the set of left-moving 
massless states with which they are combined. 

2.2 Mirror symmetry 

The complete set of MIPFs we consider here has an exact mirror symmetry. This is 
because we can always use a simple current to map, for any given spectrum, spinors of 
5*0(10) to their conjugates. Since we take random samples of the set of MIPFs, mirror 
symmetry is not automatic in the results, but will be approached asymptotically for large 
samples. It can therefore be used to get some idea about the level of saturation of the 
search. Another way of detecting that is to consider the frequency of occurrence of distinct 
MIPFs. If there are many that have occurred just once, it is likely that many have not 
been seen yet. These two ways of checking saturation do indeed agree: if the minimal 
occurrence frequency is larger than five, we find in all cases that mirror symmetry is exact. 
This notion of mirror symmetry should not be confused with the one used in the context 
of Calabi-Yau compactification. What we are talking about is a trivial transformation of 
the partition function, which always exists, because we allow the transformation to act 
on the NSR part of the theory. It is not trivial that a mirror map exists that acts only 
on the internal degrees of freedom. Presumably the arguments presented in [35] imply 
that it does, but we have not investigated this. We merely use mirror symmetry here as 
a bookkeeping tool. 

■^Note that there is in principle no obstacle for using exceptional MIPFs in heterotic constructions 
of this kind. One can simply multiply the modular matrices. This is how we deal with minimal model 
exceptional invariants. By contrast, in orientifold constructions exceptional MIPFs are an obstacle, 
because the boundary state formalisms requires fixed point resolution. The main problem in asymmetric 
heterotic strings is that there is no easy way to use exceptional invariants asymmetrically. If one wishes to 
do that for spinor currents of real fermions, it would be better to use directly the complete free fermionic 
formalism developed in |591 160j. Here we merely use a large subset for comparison with Gepner models. 
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3 Charge Quantization 



In the heterotic string with the SU (3) and SU (2) factors reahzed as affine Lie algebras 
at level 1, one gets conventional S'[/(5)-type coupling constant convergence if the U{1) 
factor is U^o'- the free bosons compactified on a circle with radius such that the number 
of primaries is 30. The Y charge of the quark doublet corresponds to the smallest f/30 
charge. 

The canonical way of obtaining the Standard Model from heterotic strings is to embed 
it in the level 1 5*0(10) factor that appears automatically in (2,2)-type constructions. 
Then 5*0(10) is decomposed to SU{3) x SU{2) x f/30 x f/20. To build a heterotic string 
theory using the bosonic string map explained in the previous sections, we start with 
a bosonic string with this subgroup structure symmetrically in the left- and the right- 
moving sector. Then we extend this subgroup in the right (fermionic) sector back to 
5*0(10), and then we map the 5*0(10) x Es characters to the NSR model. 

The extension of SU{3) x SU{2) x U-^o x U20 that yields ^O(IO) is of order 30, and is 
generated by the simple current (3, 2, 1, 4) (representations are denoted as (dim, dim,p, q), 
where p and q are the U30 and f/20 charges). In the bosonic sector we may encounter any 
cyclic subgroup of this extension, generated by powers of (3,2,1,4) that are divisors of 
30. The possibilities are listed in Table 1. 

Each of the eight possibilities gives rise to a subgroup of 50(10) which restricts the 
allowed Y-charges. But the Y-charges may also be restricted by higher spin currents in 
the chiral algebra, that do not lead to massless vector bosons in the spectrum. In the 
sixth column we indicate the charge quantization imposed by the gauge group, and in 
the seventh column the charge quantization imposed by the full CFT. Modular invariance 
implies that this also works in the other direction: the absence of certain fractional charges 
implies the presence of certain currents in the chiral algebra, by the same arguments as 
those used in [7] . In particular, the absence of half- integer charges implies the presence of 
the 10*^ power of the generator, whereas the absence of third-integral charges implies the 
presence of the 15**^ power. Absence of both half-integer and third-integer charges (and 
hence absence of all fractional charges) implies both the lO*'^ and the 15*^ power to be 
present, and hence by closure of the algebra also the 5**^ power. This is the root of 5f/(5). 
Unlike the 5* power, the 10*^ and the 15*'' are currents of conformal weight larger than 
1, and hence their presence does not lead to massless vector bosons. 

So we see that in this class of heterotic strings there is a very interesting, and es- 
sentially stringy, mechanism available to prevent some of the fractional charges in the 
spectrum without having Grand Unification of gauge symmetries. This mechanism can 
work to prevent either half-integer charges or third-integral charges, but not both: if both 
constraints are imposed the 5*f/(5) roots are automatically present as well. In a theory 
with one of these constraints imposed (but not both) Standard Model gauge couplings 
converge at the string scale, but this would not imply an enlargement of the gauge sym- 
metry. One can make the same mechanism work for all fractional charges but at the price 
of considering higher level affine algebras. In general, if one considers the gauge group 

SU{3)k, X 5f/(2)fc, X U2k, , 
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with a quark doublet realized as (3,2,1), it is possible to project out all fractionally 
charged states provided [7] 

ki + 9k2 + I2k3 = mod 36 

and one gets standard coupling unification if ^2 = ^3 and ki = 15^2- This has a solution 
for any integer ^2, and in all cases except /c2 = 1 that solution involves higher spin currents 
only. Unfortunately, there is a canonical construction - the one we use here - only for 
k2 = 1, and furthermore higher values of /c2 allow other unwanted states: higher tensor 
representations of SU{3) and SU{2). To our knowledge such higher level models have 
not been constructed (they should not be confused with higher level GUT models, which 
have been constructed (see e.g. [221 ESI EH ES]) in the class described here there is no 
GUT gauge group). 

It is instructive to compare with orientifold models (without any intention to suggest 
exact dualities). The fundamental problem with charge quantization in heterotic strings 
is that there is no a priori correlation between color and the Standard Model U{1). 
Both come from different Lie algebra factors. Grand Unification imposes the required 
correlation at the price of introducing extra massless gauge bosons. Orientifold models 
force us to impose such a correlation from the start. The reason is that we must get the 
quark doublet (3,2, |) from a bifundamental, and hence inevitably the f/(l) charge must 
come from either the U{3) stack or the U{2) stack. This makes a correlation between color 
and charge arise in a much more natural way. On the other hand, in orientifold models 
it is less clear why one family should have the structure we observe, whereas in heterotic 
strings this follows from the GUT embedding. However, in both cases we are asking the 
same question: if we require the existence of standard quarks and leptons in the spectrum, 
does this imply or allow the existence of other SU{3) x SU{2) x f/(l) representations that 
do not belong in a Standard Model family? 

In orientifold models, imposing the existence of the (3, 2, |) representation requires 
the Y charge to receive the following contributions from the U{1) generators Qa and 
of the U{3) and U{2) stack 

Y = {x-^)Qa + {x-^)Q,, + ... , 

Here the quark doublet is assumed to be a {V, V*) bifundamental. The parameter x is 
determined by the requirement that all charged particles in a family can be obtained. 
Under very general conditions (spelled out in [17]) the following possibilities exist. If all 
Standard Model particles are realized using orientable configurations, no constraint on x 
is obtained. In that case, all Standard Model particles are {V,V*) bi-fundamentals, and 
if each brane stack s contributes xQs to Y, all these contributions cancel. If one of the 
Standard Model particles originates from a {V, V) bi-fundamental or a rank-2 tensor, then 
X is either or ^. We then get the following possibilities for fractional charges: 

• X = 0. Then all states in the perturbative spectrum respect Standard Model charge 
quantization. This class includes SU{5) GUT. 
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• x = 2- Then all strings between the observable sector branes respect charge quan- 
tization, but open strings between hidden sector and observable sector branes lead 
to half-integer charge particles. If there is no hidden sector, Standard Model charge 
quantization holds. This class includes the Pati-Salam model SU (4) x SU{2) x 
SU{2). 

• Any other value of x (although x is usually constrained by the requirement that Y 
does not acquire a mass from axion mixing). Then Standard Model charge quan- 
tization holds if the entire observable sector brane configuration is orientable, and 
there is no hidden sector. Non-orientable matter introduces fractional charges ±2x, 
observable-hidden matter gives rise to fractional charges ±x. This class includes 
the trinification model S'f/(3)^ for a; = |. This value is special since there is no 
contribution to Y from the color stack, and the existence of this configuration is not 
constrained by the F-axion mixing condition. 

The first class has some similarity with the ^2 > 1 models described above. Full SU (5) 
GUT unification is possible by having a stack of five unitary branes, but if the five 
branes are split in groups of two and three one gets something akin to the higher level 
SU{3) X SU{2) X U{1) models described above. Using orientifolds one gets simple and 
natural realizations of half-integer and third-integer charges, just as with heterotic strings. 
However, there is no natural orientifold analog of the sixth-integral fractional charges that 
occur abundantly in heterotic strings. 

Let us return to heterotic strings with k2 = 1, the case of our interest. Consider the 
various subgroups listed in table [T] In order to project out a subset of the orbit, it must be 
possible to replace some of the currents by others of the same order. This is a necessary 
condition for being able to realize the extension asymmetrically. To see the options it is 
sufficient to consider the prime order generators, which are of order 2, 3 or 5. 

If the internal CFT does not contain a simple current of order 5, the fifth order current 
(1, 1,6,4) cannot be projected out. The only current in the left-moving sector that can 
replace it is then its own conjugate, which is of no interest. The same is true for the order 
3 current (3, 1, 10,0). Furthermore, in both cases the presence of an order 5 (respectively 
3) current of non-integer conformal weight in the internal sector is sufficient to allow the 
5*0(10) current (1, 1, 6, 4) (respectively (3, 1, 10, 0)) to be projected out. One can always 
combine a suitable power of that current with the order 5 or 3 current in f/30 in order 
to get an integer spin current that is non-local with respect to the 5*0(10) currents one 
wants to project out. This would not work for integral weight order 5 or 3 currents in the 
internal sector. Although they would seem perfect candidates for replacing the 50(10) 
currents, they are local with respect to them, and hence cannot affect them. Fractional 
spin currents of order 5 or 3 are present in = 2 minimal models if A; + 2 contains a 
single factor 5 or 3, because the simple current group is Z2fc+4 x Z2 for k even and Z^k+s 
for k odd. 

For the order two current (1, 2, 15, 0) the discussion is a bit more complicated, because 
the building blocks we use always contain currents of even order. In a suitable extension, 
these can become currents of order two that could replace (1, 2, 15, 0) in the bosonic sector. 



13 



However, one must also find left images of the alignment currents, which are also of order 
two. This can all be analysed in detail, but it is easier to find out empirically. The result 
is that the order two current can be projected out in all but two cases with exceptional 
MIPFs. The reason exceptional MIPFs can behave differently is that they automatically 
imply a D-type extension by a current of order two, which adds an extra constraint to 
the set of order two currents. 

The different possibilities can be summarized as follows: 

• The currents of order 2, 3 and 5 can all be projected out. Then all groups are pos- 
sible. This happens for example for the tensor product (3, 4, 13, 13), which contains 
currents of order 2, 3 and 5. 

• The current of order 5 cannot be projected out. Hence the order in column 4 of table 
[ijmust be divisible by 5. The only allowed groups are then Nrs. 3, 5, 6 and 7 (LR, 
Pati-Salam or SO{10)). An example is the tensor product (1, 1, 1, 1, 1, 1, 1, 1, 1). In 
these models S'f/(2)/j cannot be broken. 

• The current of order 3 cannot be projected out. The only allowed groups are then 
Nrs. 2, 4, 6 and 7 in table [l] An example is (3, 3, 3, 3, 3). In these models there can 
only be half-integer and integer charges. 

• The currents of order 3 and 5 cannot be projected out. The only allowed groups are 
then Nrs. 6 and 7. Examples are (6, 6, 6, 6) and all free fermionic models. The only 
allowed groups are Pati-Salam or SO{10), and only integer and half-integer charges 
are possible. 

• The currents of order 2 and 5 cannot be projected out. The only allowed groups are 
then Nrs. 5 and 7. This happens only for the exceptional models (1, 16, 16*, 16*) 
and (1, 16*, 16*, 16*). Only integer and third-integer charges can occur, not half- 
integer and sixth-integer. These are the only cases for which the options for charge 
quantization cannot be read off directly from the fc-values of the factors, and also 
the only ones where the order two current cannot be projected out. 



4 Results 

4.1 Spectrum selection 

The spectra we have obtained have a gauge group containing SUiS) x SU{2) x f/(l) as 
a subgroup. We assume that eventually all other gauge symmetries are broken to this 
subgroup, so that only chirality with respect to this subgroup matters; this is always 
what we mean by chirality henceforth. If a spectrum contains chiral matter that does not 
fit in a Standard Model family, then there is often not even a sensible definition of the 
number of families, because the net number of chiral representations for the usual quark 
and lepton multiplets Q, U, D, L or E can be different; if on the other hand only the 
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representations Q, U, D, L and E are present, anomaly cancellation guarantees that they 
have the same chiral multiplicities. Anomaly cancellation is exact in all the spectra we 
obtain, and in particular there are no t/(l)'s whose anomalies are cancelled by the Green- 
Schwarz mechanism. This is because there is an unbroken Eq factor in the gauge group, 
which cannot contribute to a TrF TtF^ term in the anomaly polynomial. But on the 
other hand modular invariance guarantees that the TtF^ factor must have contributions 
from all gauge groups, including the unbroken Eg. 

Spectra with chiral exotics are just counted, but not distinguished. All remaining 
spectra are distinguished on the basis of the following data: 

• The combination of minimal models and the choice of exceptional MIPFs. 

• The number of families. 

• The number of Q, U, D, L, E mirror pairs, counted separately. 

• The number of singlets. 

• The total number of fractionally charged SU{3) x SU(2) x U{1) representations. 

• The SO (10) part of the gauge group (the eight possibilities listed in table [l] below). 

• The total dimension of the gauge group. 

• The actual charge quantization observed in the massless sector. This is usually 
precisely the CFT quantization in Table 1, but in very rare cases it may happen 
that the smallest allowed charges are only seen in the massive spectrum, and are 
absent from the massless spectrum. 

We ignore B — L quantum numbers and treat right-handed neutrinos just like other 
Standard Model singlets. 

For each distinct spectrum, we keep track of the number of times it occurred in a 
random search as defined in section 2.1. This leads to two ways of counting spectra: 
the total number of times it occurred, and the total number of distinct ones. We will 
refer to these as occurrence counting and spectrum counting. Only the former method is 
available for comparing exotic and non-exotic spectra, since for the exotic ones we only 
have a total count. This is therefore the method used in the next subsection. For the other 
results we have a choice. For example, one can make a distribution of the total number of 
singlets based on total occurrence, or plot the number of distinct spectra with a certain 
number of singlets. The first method favors the GUT spectra that are obtained with the 
least amount of simple current modification. Especially the simplest (2, 2) Eq spectrum 
obtained using the diagonal invariant tends to come out very frequently. Thus unmodified 
GUT spectra are overcounted relative to the others. In other words, the objects closest to 
the lamppost are most easily seen, but that does not mean that they are more numerous. 

True landscape statistics would require counting of distinct, moduli stabilized heterotic 
ground states, and this is at present not possible. All we can reasonably expect to do now 
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is to get some idea about rare and fairly generic features. We expect spectrum counting to 
be closer to actual distinct vacuum counting than occurrence counting, and hence when 
possible we favor the former method. Note that spectrum counting leads to distributions 
that do not depend on the choice of randomization. 

For all quantities except exotics we have chosen the latter method. To illustrate the 
difference between the two counting methods we show the family distribution for the 
(1, 16*, 16*, 16*) model in both ways in section 5. 

4.2 Occurrence of fractional charges 

In table |2] we list the various possibilities and their relative frequencies. We distinguish 
standard Gepner models and exceptional Gepner models, as well as free fermions. In each 
case we have selected a random set of simple currents and a random choice of discrete 
torsion parameters. For the Gepner models, we have considered simple current groups 
generated by up to four currents, whereas for free fermions we have allowed up to six 
currents. This table is based on about 64 x 10^ samples for standard Gepner models, 
8 X 10^ for exceptional ones and 130 x 10® for free fermions. In the last three columns 
we indicate which percentage of these samples belonged to a certain type of fractional 
charge realization. Obviously there are many ways of distributing random numbers over 
the space of possibilities, so these numbers just give an idea of how rare some of them 
are, and should not be confused with proper landscape statistics. 

The hues in the table are ordered according to desirability. Gauge groups closest to the 
Standard Model appear first, and cases with chiral fractional charges are at the bottom. 
We regard anything below the line as unacceptable. Unbroken SU{5) and S'O(IO) GUTs 
are unacceptable because their massless spectra contain no Higgses to break the GUT 
gauge group. One might still consider composite Higgses, but this is too far-fetched to 
consider seriously. The other extended gauge symmetries, Pati-Salam and SU{2)r, can 
in principle be broken by Higgses that are not forbidden in the massless spectrum, and 
that in general do indeed occur. 

Above the line there are five cases where some a priori allowed fractional charges are 
entirely absent from the massless spectrum (of course they must occur in the massive 
spectrum). In two of those there are no fractional charges whatsoever, and in the other 
three the fractional charge quantum is larger than expected on the basis of the algebra 
{i.e the CFT quantization listed in column 1 does not match the actual quantization for 
massless particles listed in column 2)). All of these are extremely rare in Gepner models. 
Furthermore, we have seen them only in purely non-chiral spectra. In free fermionic 
models, the only possible gauge groups are S'O(IO) and Pati-Salam. In the latter case, 
we found that in about 8% of all cases the expected half-integer charges were completely 
absent from the massless spectrum. However, most of these spectra are purely non-chiral 
as well; only about 1% of them (.071% of the total) had chiral families (either 12 or 24). 
A similar result was reported in |28j. These authors looked at a different class of free 
fermionic models, designed for getting three families, and also reported a number of cases 
of Pati-Salam models without massless fractionally charged exotics. 
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Even within the class of free fermionic models, the examples without massless frac- 
tional charges, but with a chiral spectrum are rare. In our opinion, the reason we do not 
observe fractional charges is not that our universe ended up in such a rare ground state 
just by chance. Within the heterotic landscape, a more plausible explanation is that we 
ended up in a universe with, to first approximation, vector-like fractional charges, and 
that these have the same fate as other vector-like particles, namely acquire a mass. In 
other words, there is no good reason to prefer spectra without massless fractional charges 
over those with vector-like fractional charges at our present level of approximation, but 
it is reasonable to reject those with chiral fractional charges. If we discard the SU{5) 
and 5*0(10) GUTs, the relative probability for encountering only vector-like fractional 
charges is substantial (of order 40% with our method of random sampling, but in any 
case not absurdly rare). But even this option is not free of problems. The fractionally 
charged particles must become sufficiently massive and consequently rare to have escaped 
detection so far. 

The appearance of chiral fractional charges correlates almost perfectly with the ap- 
pearance of chiral families: tensor combinations in which the number of chiral families is 
always zero never had chiral fractional charges either. There is one tensor combination 
with chiral spectra, but without chiral exotics in any of those spectra, namely (5, 5, 5, 12). 
There is also precisely one free fermion model with that property, namely the one built 
out of 12 Ising models and 3 free bosons, i.e the maximal number of real fermions we 
allow. 

4.3 The number of families 

In previous work [Ij it has been observed that the number of families for the different 
MIPFs of any given Gepner model is quantized in certain units. In that paper, for 
each model all simple current MIPFs were considered that gave rise to (2,2) and (1,2) 
compactifications. The greatest common divisor of the number of families for a given 
Gepner combination will be called the "family quantum" A. The following values for A 
were found (each exceptional modular invariant is treated as a separate category): 

120, 96, 72, 60, 48, 40, 36, 32, 24, 12, 8, 6, 4, 

plus the value 3, which only occurred for the tensor product (1, 16*, 16*, 16*), where the 
* denotes the use of an exceptional SU{2) invariant. 

If one considers additional MIPFs the value of A can only decrease. In this paper 
we extend the set of MIPFs of [1] in two ways: by allowing breaking of world-sheet 
supersymmetry in the left sector, and by allowing breaking of S'O(IO) to SU{3) x SU{2) x 
U{1) X U{1). We do indeed observe that A does go down in many cases. We have found 
the following values for A: 

12,6,2,0 

plus the aforementioned A = 3 case, which remains unchanged, and will be discussed in 
more detail below. The largest number of families we have found is 480, and the next-to 
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largest is 360. The distribution has a huge peak at zero famihes with over 550.000 distinct 
MlPFs; there are about 120.000 with 6 and 100.000 with 12 famihes and 30.000 with 24. 
Values not divisible by 6 are less common: 25.000 MlPFs with 2, 20.000 with 4 and only 
about 1200 with 3. All values quoted here count mirror pairs as two distinct spectra. 
Values not divisible by six come from different tensor combinations than those divisible 
by 6, so one should not attach too much importance to this comparison. However, it 
certainly remains true that the value 3 is disfavored. 

These values are found for the exceptional and standard Gepner models as well as for 
free fermions (where the only values we found were 0,6,12,18,24 and 48, the same as in 
[61] ) . The distribution of the values of A over the various CFTs we considered are shown 
in table [H 

Remarkably, the CFTs that give rise to A = 2 have total numbers of families that 
are never a multiple of three. For example, the combination (3,3,3,3,3) yields numbers 
of values covering all even values up to 50 (with a small, irregular tail reaching the well- 
known value 100 for the quintic Calabi-Yau manifold), but with 0, 6, 12, 18, 24, 30, 36, 
42 and 48 all missing. Furthermore, there is an easily recognizable pattern for the tensor 
products of the six combinations in this class: 

(6,6,6,6) 
(3,3,3,3,3) 
(3,6,6,18) 
(3,3,18,18) 
(3,3,12,33) 
(3,3,9,108) 

Obviously, all values of k are divisible by three, and furthermore these are the only 
combinations for which that is true. This is at this moment just an empirical observation, 
for which an explanation is still lacking. It is puzzling that it is the value of k rather 
than k + 2 that seems to determine the family quantization properties. The value k + 2 
is a divisor of the simple current order of the minimal model, and played an important 
role in the discussion of charge quantization. But it is not clear to us what fundamental 
property of the CFT the value of k relates to. 

4.3.1 Kazama-Suzuki models 

To see if there is any chance that the foregoing observation has some validity beyond 
Gepner models and free fermions we have examined some old results on Kazama-Suzuki 
models [37] . For these models the unaltered (2,2) spectrum was computed more than 
twenty years ago |66l EZ] . In principle one could treat these models in the same way as 
we treated the Gepner models: allow breaking of world-sheet supersymmetry to (0,2), 
and allow breaking of S'O(IO). However, to carry this out we need to know the exact 
CFT spectrum of these models, and not just the spectrum modulo integers obtained from 
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the coset construction. Computing this requires a straightforward, but tedious character 
decomposition in some cases, and a more tedious and not even straightforward fixed point 
resolution procedure in the other cases. To our knowledge, these results are not available 
in the literature. To compute the spectra for unmodified (2,2) models just requires the 
Ramond ground states. Even in that case, resolving the fixed points turns out to be a 
difficult matter. Cases without fixed points were considered in |66] , and with fixed points 
in [67]. Both papers also studied mixed combinations of Kazama-Suzuki and N = 2, level 
k minimal models (which are special examples of Kazama-Suzuki models). The following 
two observations hold for all examples listed in these two papers 

• If minimal models are present with k not divisible by 3, then the number of families 
is divisible by 3. 

• If minimal models are present and the number of families is not divisible by 3, then 
the values of k are divisible by 3 for all minimal models in the tensor product. 

The property "fc is divisible by 3" seems to extend to Kazama-Suzuki models even if it 
is not clear what the analog of k is. For example, the Kazama-Suzuki model^yl(2, 3, 3), 
A{1, 3, 3), A(2, 2, 6), A(l, 3, 4), A(l, 3, 9), A{1, 3, 10), A{1, 3, 12), 5(3, 3), 5(3, 7), 5(3, 8), 
5(3,9), C(3, 1) and 5(6,1) appear to have that property. All of them give a number 
of families that is not divisible by three in combination with minimal models with all 
k divisible by three. There are a few non-trivial tests of this statement because there 
are combinations of these building blocks with each other. Indeed, 5(3,3) x 5(3,3), 
5(3,3) X A(l,3,4) and A(l,3,4) x yl(l,3,4) all yield a number of families not divisible 
by three. There are a few Kazama-Suzuki models that yield a number of families not 
divisible by three all by themselves: A{2,2, 12), 5(7,2), 5(9, 1) and C(3,4). 

One can also identify some Kazama-Suzuki models whose presence implies that the 
number of families is divisible by three, which are therefore analogous to = 2 minimal 
models with k not divisible by three. These can be identified if they appear by them- 
selves, or are tensored with building blocks that all belong to the "fc divisible by 3" class. 
They include A(l,6,7), A(l,5,9), A(l,4,15), A(l,4,6), A(l,3,6), A(l,3,16), A(l,2,3), 
A(l,2,2), A(l,2,9), 5(3,12), A(2,3,5), A(3,3,3) and 5(6,6). This list provides useful 
counterexamples to some guesses, for example that all building blocks in the A{l,3,k) or 
5(3, k) series belong to the "A; divisible by 3" class. 

In Kazama-Suzuki models, if the number of families is divisible by 3, it is almost always 
also divisible by 6. Three exceptions were found in [HS], with 63, 63 and 93 families. 

One is led to conjecture that all = 2 building blocks come in two classes: those whose 
presence in the tensor product enforces a factor of three in the number of families, and 
those whose presence inhibits a factor of three. Furthermore the enforcers are dominant: 
the inhibitors can win only if no enforcers are present. In addition to the Gepner models 
with k not divisible by three and the Kazama-Suzuki models listed above, apparently also 
the free fermionic building blocks are in the enforcer class. It is fascinating to see the 
special role of the number three in this observation. Twenty years ago one might have 

^See [66] or [67] for the notation. 
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been tempted to link this to the observed family number, despite the additional factor 
of two one gets in nearly all cases. However such a relation is far less convincing in the 
full string theory landscape. In fact, even in the heterotic landscape, if we move further 
away from (2,2) models, we have founcj^ that the number three is neither favored nor 
disfavored. 

It is an interesting question what the origin is of the ubiquitous factor of six (or 
three) in these constructions. It is clearly not the number of colors, since this factor is 
seen already before considering the Standard Model subgroup of 5*0(10). A more likely 
origin is the number of compactified dimensions. This is indeed true in certain orbifold 
models, but is not clear why that would extent (with the exceptions mentioned above) to 
interacting CFTs. 

4.4 Distributions 

Apart from the two main issues we focus on in this paper, the spectra we have collected can 
of course also be used to study other quantities of interest, in particular how certain non- 
chiral quantities are distributed. Unlike charge quantization and the number of families, 
we do not expect that such quantities can be studied entirely within the context of RCFT, 
but still a few interesting observations can be made. 

In figs, [ijjs] we plot the distribution of the number of singlets, of fractional charges, 
of quark doublet (Q) mirror pairs, and of lepton doublet (L) mirror pairs. In each case 
Gepner models and free fermions are compared. On the vertical axis the number of 
distinct spectra is indicated. Note that we are plotting the number of distinct spectra 
and not their occurrence frequency. This explains why the SM, Q=^ model has a much 
larger peak than table [2] would suggest. The distributions are shown for each of the eight 
types listed in table [T| although some are too small to be visible. The plot of Q-mirrors 
is different from the others. For better visibility it has stacked histogram bars (the bars 
are on top of each other, not behind each other), so that each vertical bar is divided into 
the eight separate contributions, distinguished by colors. In these plots, type 7 (5*0(10)) 
appears at the top of each bar, and type (barely visible) at the bottom. 

The distributions of the singlet mirror pairs of types U and E are nearly indistinguish- 
able from type Q, where D is nearly identical to L, so there is no need to show these. The 
fact that the Q, U, E resp. D, L mirror distributions are almost the same undoubtedly 
finds its origin in the underlying SO(IO) structure. Indeed, in 5*t/(5) and 5*0(10) these 
distributions are automatically identical. Furthermore the current (3, 1, 10, 0) present in 
the model SM, Q=l/2 as well as the Pati-Salam model maps U to E. In all other cases 
the distributions are a priori not related, but nevertheless we find that they remain nearly 
identical, with the Q, U, E distributions peaking around or below 5 mirror pairs, and the 
D, L distributions around 20. 

^See [55| for some examples using "heterotic weight hfting". In a forthcoming paper [68] we will 
present a full analysis of models with heterotic weight lifting, with family distributions in which all 
families appear. 
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A rather remarkable property of all these distributions, with the exception of the 
fractional charges, is that they tend towards the phenomenologically correct value, zero, 
as the gauge group is broken to approach the Standard Model. For the Q, U and E mirrors, 
zero is easily reachable within the context of RCFT, for D and L mirrors this is clearly 
much harder, and for singlets this seems impossible. The results for free fermions show 
similar features, but they appear to be more affected by RCFT artifacts, in agreement 
with the expectation that Gepner models provide a richer and more accurate scan of the 
heterotic landscape. 



In figures [9] and 10 we show the distribution of the dimension of the gauge group (not 
including the "hidden" Eg) for all distinct spectra with a non-zero number of families. In 
Gepner models the minimal dimension of the gauge group is determined by the subgroup 
of 5*0 (10) that is realized, plus the number of factors in the internal tensor product, each 
of which yields at least a U{1). Furthermore the S'O(IO) (sub)group may be extended into 
the internal sector, and there may be factors that live entirely in the internal sector. The 
best-known S'O(IO) (sub)group extensions are Eq and S'f/(3)^. Each of these absorbs one 
linear combination of the minimal model f/(l)'s into the Cartan sub-algebra. We then 
get the following minimal dimensions for the various gauge groups if there are K minimal 
model factors, with K = 4, ... 9. The last column gives the absolute minimum for the 
dimension, which occurs for K = 4, the most common value. 



SM 


13 + K 


17 


LR 


15 + K 


19 


Pati-Salam 


21 + K 


25 


SU{3f 


23 + K 


27 


SU{5) 


25 + K 


29 


SO{10) 


A5 + K 


49 


Eq 


77 + K 


81 



In figure |9| the Eq and S'O(IO) peaks are clearly visible, and have a small tail towards 
larger dimensions due to the dependence on K. Also the SU{5) and Pati-Salam peaks 
are clearly distinguishable, as well as the peak at the minimal value for the number of 
dimensions, 17. The LR peak and the trinification peak are less easily identifiable; the 
former may be related to the peak at 20, whereas the latter is hidden in the noise. The 
results for free fermions in figure 10 are more difficult to interpret, because in that case 
large gauge groups are generically present. 



5 The three family case 

Despite the ubiquitous factors of six in the number of families, we have not found a 
single new tensor combination where the greatest common denominator of all family 
multiplicities is reduced to three. This leaves us with the same case first considered by 
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Gepner, the (1, 16*, 16*, 16*) tensor product. In [T] MIPFs of this tensor product were 
considered with (2, 2), (1, 2) and also (0, 2) world sheet supersymmetry, but with unbroken 
5'O(10), and MIPFs constructed as products of at most two single current ones. 

In yyj 44 distinct spectra with three families were listed, after modding out mirror sym- 
metry. In the present study we have found 1220 three-family spectra, forming precisely 
610 mirror pairs. The exact mirror symmetry suggest that this set is complete, or very 
nearly so. Also the complete set of spectra with a non- vanishing number of chiral fam- 
ilies has nearly exact mirror symmetry. It contains 1218 mirror pairs plus one unpaired 
spectrum, so that almost half of all the total number of models have three families. 

The tensor product (1, 16*, 16*, 16*) is one of just two where only the gauge groups 
LR,Q = 1/3 and S'O(IO) are possible within the c = 5 system S. These gauge groups 
may still be extended into the internal sector. In total, there are 58 such extensions, and 
the most prominent among them are Eq and SU{3)^ (also called trinification) . 

In fig. 11 we show the family distribution of all spectra, and in fig. 12 we show the 
same for the total number of occurrences. Each bar in the histogram is divided into 
three pieces {i.e the bars are stacked on top of each other, not hidden behind each other), 
indicating, from top to botton, the contribution of 5*0(10) GUT models (including Eq), 
the contribution of LR, Q = 1/3 models with more than 19 gauge bosons, and separately 
those with exactly 19 gauge bosons. The latter number is the absolute minimum possible 
in this tensor combination. It corresponds to the number of generators of SU{3) x SU{2) x 
SU(2) X U(l) plus 4 f/(l)'s from each of the minimal model factors. The intermediate 
case is labelled "trinification" since the majority has 27 gauge bosons, and are trinification 
models (though not all of them are). The maximal number of families is 48, and is reached 
by an S'O(IO) model that occurs so rarely that its bar is not visible in the plot (the total 
occurrence was about 50, compared to about 18000 for the 45 family models). Somewhat 
atypically, the diagonal MIFF does not have the largest number of families. It has 27 
(8 families and 35 mirror families), and dominates the set with 120000 occurrences, plus 
65000 for its mirror. Trinification models have a maximum of 12 families, whereas for 
LR,Q = 1/3 the maximum is 9. It is noteworthy that the number of families shifts 
towards smaller numbers as the gauge group approaches the Standard Model. This is 
presumably in agreement with expectations, such as those first expressed in |i3] . 

A comparison of the two plots illustrates the "lamppost" effect mentioned in the 
previous section. It appears evident that the basic GUT models are overrepresented in 



occurrence counting. If instead we use spectrum counting (fig. 11) we get a more or 
less exponential fall off with the number of families. The slope is considerably smaller 
than for orientifold models (c./. [191 [20]). This is a general feature, which we will study 
in more detail in a forthcoming paper [68], where family distributions that cover all 
integers are obtained. It is also remarkable that the low-family cases are dominated by 
trinification and LR,Q = 1/3, and that the three-family case is dominated by spectra 
with the smallest possible gauge group. The fact that the family distribution, using 
spectrum counting, peaks exactly at the value 3 is interesting, but not typical. In most 
cases the peak occurs at zero families (except in the class where zero is excluded because 
the number of families is not divisible by 3) because there exist algebras that do not have 
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any chiral representations, and hence only contribute at zero. 

In trinification models the Standard Model Y generator and B — L are embedded in 
SU{2)pi X U{1) LR. The resulting rank-5 algebra is extended to rank 6 by adding a suitable 
combination U{1)t of the U{1) factors from the minimal models. Then SU{2)l x SU{2) /j X 
U{1)lr X U{1)t is extended to SU{3Y by extra roots that extend into the internal sector. 
The Standard Model is embedded in the following way into the three SU(3) factors 

(3,1,1) (3,1,0) (3) 

(1,3,1) ^ (i,2,-i) + (l,l,i) (4) 

(1,1,3) 2 X (1,1,1) + (1,1, 4) (5) 

(6) 

One standard family is contained in the representation (3,3*, 1) + (3*, 1,3) + (1,3,3*), 
which in addition to the usual 15 charged quarks and leptons contains two singlets and 
a D and L mirror pair. Group theoretically other representations are allowed. In heterotic 
strings one may, and in general will, encounter also (3, 1, 1), (1, 3, 1), (1, 1, 3), (3, 3, 1), (1, 3, 3) 
and (3, 1,3). All of these give rise to third-integrally charged particles. 

It is noteworthy that trinification models can be realized in terms of bi-fundamentals, 
and therefore have a natural construction in terms of unoriented open strings. They have 
indeed been found in RCFT orientifold constructions [TTj, where they constituted less 
than 1% of the total set of three family models. 

The frequency of trinification models in the complete set is clearly seen in the distri- 



bution of the gauge group dimensions shown in fig. 13 Clearly visible are the peaks for 
LR,Q = 1/3 at 19, trinification at 27, SO{10) at 49 and Eq at 81. 

The distributions of non-chiral states are very similar to those for the full set of Gepner 
models. The number of fractionally charged states (the total number of SU (3) x SU{2) x 
U{1) representations, divided by two) peaks at 143, with a minimum at 104. For the 
LR,Q = 1/3 models the number of Q,U,E mirrors peaks at one or two with significant 
overlap with zero; but the D and L mirrors peak at 9, with a minimum value of 3. For 
the 5*0(10) models these distributions peak at larger values, as in the general case, see 
figs. |5] and [7| 

The minimal value for the number of D and L mirrors, 3, occurs precisely for trinifi- 
cation models, where these mirrors are part of the basic Standard Model family. There 
are six mirror pairs of spectra with this minimal value, which are very similar: They are 
precisely the six permutations of the following spectrum: 

3 X [(3, 3*, 1) + (3*, 1, 3) + (1, 3, 3*)] + 72 x (1, 1, 1) 
+ [2 X (3, 3, 1) + 3 X (3, 1, 3) + 3 X (1, 3, 3) + c.c] 
+ [19 X (3, 1, 1) + 19 X (1, 1, 3) + 17 X (1, 3, 1) + c.c] 

Although this spectrum is free of mirror families (ignoring the mirrors within each family), 
it does have a large number of fractionally charged vector-like states, which phenomeno- 
logically are much more problematic than mirror fermions. Unfortunately, we cannot 
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reverse this situation, and allow more mirrors but fewer fractional charges. In all cases 
the number of fractionally charged particles is more or less the same as in this example. 

The fact that all six permutations of the SU{3) factors occur is intriguing. This 
permutation symmetry is not directly due to the permutation symmetry of the three 
factors k = 16. Permutation symmetries among minimal model factors do no affect the 
spectrum. We did not investigate if other trinification models in the set we have collected 
are similarly related. 

Almost half of the three-family models have the theoretical minimal gauge symmetry 
SU (3) X SU {2)l X SU{2)ji x U{1)^. All of these have more mirror fermions than the min- 
imal trinification model. The minimal number is 12 (as opposed to 6 for the trinification 
model displayed above), namely one U, six D and five L mirrors. 

6 Conclusions 

We studied interacting CFT constructions of heterotic strings with two main questions in 
mind: how common are fractional charges in the spectrum, and how common are three 
chiral families. 

The answer to the first question is more positive than expected. Mechanisms are 
available to limit the fractional charges to half-integer or third-integer. Partly as a result 
of that, in a substantial fraction of all cases the fractionally charges are vector-like, and 
hence may acquire a mass if the spectrum is perturbed. Nevertheless, this does not 
fully restore the original attractiveness of the heterotic string, in our opinion. Absence 
of fractional charges is still more beautifully explained in field theoretical GUT models, 
as well as their string-theoretic realizations, such as F-theory. But unfortunately those 
approaches do not offer an explanation why the Standard Model would emerge via a GUT 
rather than directly. 

The answer to the second question goes in the right direction, but remains disappoint- 
ing. Previously, the number of families was - with one famous exception - a multiple 
of 6 or 4. Now we find that these numbers are typically reduced, but they are still only 
multiples of even numbers. Furthermore we find that there are two classes, depending 
on the tensor product considered: either all family multiplicities are a multiple of three, 
or none of them is divisible by three. In the complete set of models considered, three 
families occur still very rarely. However, this problem will be solved in a forthcoming 
paper [HH] , where we consider lifted Gepner models, and will arrive at the conclusion that 
three families occur roughly as frequently as two or four. 

In addition to these questions we studied distributions of various quantities, and found 
a remarkable tendency of these distributions in the right direction, in the following sense. 
As the broken gauge groups gets smaller and hence closer to SU{3) x SU{2) x U{1), all 
other quantities approach the Standard Model as well. The number of families moves 
towards smaller numbers, and a distribution emerges that peaks at zero, and has a (fairly 
slow) exponential fall-off. With our present results that conclusion holds for multiples 
of six or two families, but in [68] we will present the same conclusion for all integers. 
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Furthermore the number of singlets and mirror quarks and leptons peaks at smaller values 
as the gauge group approaches the Standard Model. For Q, U and E mirrors the peak 
is close to zero, and therefore it is not hard to get exactly zero entirely within RCFT, 
without giving vevs to scalars. For D and L mirrors zero is harder to reach, but since an 
L mirror pair has the quantum numbers of a susy Higgs pair, one might even see that as 
a positive feature. 
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Nr. 



Name 



Current 



Order 



Gauge group 



Grp. 




1 
2 
3 
4 
5 
6 
7 



SM, Q=l/6 
SM, Q=l/3 
SM, Q=l/2 
LR, Q=l/6 
SU(5) GUT 
LR, Q=l/3 
Pati-Salam 
SO(IO) GUT 



(1,1,0,0) 
(1,2,15,0) 
(3,1,10,0) 
(1,1,6,4) 
(3,2,5,0) 
(1,2,3,-8) 
(3,0,2,8) 
(3,2,1,4) 



1 
2 
3 
5 
6 

10 
15 
30 



SU{3) X SU{2) X C/(l) X U{1) 
SU{3) X SU{2) X U{1) X C/(l) 
SU{3) X SU{2) X [/(I) X U{1) 
SU{S) X 5J7(2)l X SU{2)r x ;7(1) 

^[/(S) X C/(l) 
S'I7(3) X 5C/(2)i, X SU{2)r x f/(l) 
5?7(4) X SU{2)l X 5C/(2)fl 
50(10) 



Table 1: List of all Standard Model extensions within 5*0(10) and the resulting group theory 
and CFT charge quantization (last two columns). We refer to these subgroups either by the 
label in column 1 or by the name in column 2, where "LR" stands for left-right symmetric. 
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Fractional charges 


CFT type 


Type 


Massless 


Chiral? 


Gepner 


Gepner (exc.) 


Free Fermion 


Qi\,f r\ I/O 


1 
1 

2 


NO 


0.97% 


0.30% 


- 


ci\/r I/O 
biVi, (^=i/o 


1 
i 

3 


NO 


0.42% 


0.76% 


- 


ci\/r 1 /^^ 
oM, 14=-'-/ 


1 
i 

3 


NO 


0.000023% 


- 


- 


QA/r 1 /c 
oM, C^=i/0 


i 

6 


NO 


0.23% 


0.28% 


- 


T TD 1/0 


NO 


NO 


0.0000031% 


~ 


- 


T TD 1/0 


1 
1 

3 


NO 


3.18% 


10.15% 


- 


T T3 1 /fi 

Lti, (4—^/0 


1 
1 

3 


NO 


0.0013% 


0.010% 


- 


Lti, (4—1/0 


1 
2 


NO 


0.000029% 


0.0000126% 




T R O— 1 /fi 


1 

6 


NO 


1.06% 


2.86% 




Pati-Salam 


NO 


NO 


0.001% 


0.016% 


8.43% 


Pati-Salam 


1 
2 


NO 


15.53% 


9.41% 


68.42% 


SU(5) GUT 


NO 


NO 


13.06% 


4.67% 




SO(IO) GUT 


NO 


NO 


32.7% 


32.99% 


21.63% 


SM, Q=l/2 


1 
2 


YES 


1.09% 


0.09% 




SM, Q=l/3 


1 
3 


YES 


1.63% 


0.82% 




SM, Q=l/6 


1 

6 


YES 


0.66% 


0.25% 




LR, Q=l/6 


1 
6 


YES 


4.98% 


2.86% 




LR, Q=l/3 


1 
3 


YES 


22.88% 


33.89% 




Pati-Salam 


1 
2 


YES 


1.65% 


0.78% 


1.5% 



Table 2: Allowed and observed fractional charges. In column 1 we display the CFT 
quantization as shown in Table 1. In column 2 we show the fractional charges actually 
seen in the massless sector. In column 3 we indicate if those fractional charges are chiral. 
The other three columns show the relative frequencies of these cases in standard Gepner 
models, exceptional Gepner models and free fermion models. Spectra below the horizontal 
line are phenomenologically unacceptable. 
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A 


Gepner 


Gepner (exc.) 


Free Fermions 


12 


10 


5 


5 


6 


132 


33 


18 


3 





1 





2 


5 











21 


20 


39 



Table 3: The number of cases with a given family quantum A. 
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Figure 1: Distribution of singlets for Gepner models 
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Figure 2: Distribution of singlets for Free Fermionic models 
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Figure 3: Distribution of fractional charges for Gepner models. 




Figure 4: Distribution of fractional charges for Free Fermionic models (only Pati-Salam 
model can occur in this case) 
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Figure 5: Distribution of Q-type mirrors for Gepner models. Different types are indicated 
by different colors stacked on top of each other, ranging from 50(10) on top to SM, Q = ^ 
at the bottom. 




Figure 6: Distribution of Q-type mirrors for Free Fermionic models. The color scheme is 
as in fig. [Sj but only 5*0(10) and Pati-Salam models occur. 
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Figure 9: Distribution of gauge group dimensions for Gepner models. 




Figure 10: Distribution of gauge group dimensions for Free Fermionic models. 
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Figure 11: Family distribution for the (1, 16*, 16*, 16*) tensor product based on distinct 
spectra. 
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Figure 12: Family distribution for the (1, 16*, 16*, 16*) tensor product based on total 
occurrence frequency. 
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Figure 13: Distribution of gauge group dimensions for the (1, 16*, 16*, 16*) tensor product. 



